Data Processing Inequality
Formula of diminishing influence.
Formula
X→Y→Z⇒I(X;Y)≥I(X;Z)Proof
I(X;Y,Z)∵I(X;Z∣Y)∴I(X;Y)=I(X;Z)+I(X;Y∣Z)=I(X;Y)+I(X;Z∣Y)=0(logP[X∣Y]⋅P[Z∣Y]P[X;Z∣Y]=logP[X∣Y]⋅P[Z∣Y]P[Z∣X,Y]⋅P[X∣Y]=log1=0)=I(X;Z)+I(X;Y∣Z)≥I(X;Z)Usage
I(X;Z)⟹H(X∣Z)=H(X)−H(X∣Z)≤H(X)−H(X∣Y)=I(X;Y)≥H(X∣Y)Interpretation
- Z has its own freedom. But part of it comes from Y.
- I(X;Z∣Y)=0: No additional information gained for X from Z if Y is known.
- I(X;Z): Degree of relevance. Information gained from Z for X.
- Less than the information that can be gained from Y.
- H(X∣Z): Uncertainty of X with additional information from Z.
- Greater than that when having additional information from Y.